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^ ', Abstract 

J>y Consider the following online version of the submodular maximization problem under a matroid 

constraint: We are given a set of elements over which a matroid is defined. The goal is to incrementally 
choose a subset that remains independent in the matroid over time. At each time, a new weighted rank 
function of a different matroid (one per time) over the same elements is presented; the algorithm can 
add a few elements to the incrementally constructed set, and reaps a reward equal to the value of the 
new weighted rank function on the current set. The goal of the algorithm as it builds this independent 
set online is to maximize the sum of these (weighted rank) rewards. As in regular online analysis, we 
£^ | compare the rewards of our online algorithm to that of an offline optimum, namely a single independent 

set of the matroid that maximizes the sum of the weighted rank rewards that arrive over time. This 
O | problem is a natural extension of two well-studied streams of earlier work: the first is on online set cover 

algorithms (in particular for the max coverage version) while the second is on approximately maximizing 
submodular functions under a matroid constraint. 

In this paper, we present the first randomized online algorithms for this problem with poly-logarithmic 
competitive ratio. To do this, we employ the LP formulation of a scaled reward version of the problem. 
Then we extend a weighted-majority type update rule along with uncrossing properties of tight sets in 
the matroid polytope to find an approximately optimal fractional LP solution. We use the fractional 
solution values as probabilities for a online randomized rounding algorithm. To show that our round- 
ing produces a sufficiently large reward independent set, we prove and use new covering properties for 



in 
o 



^vq . randomly rounded fractional solutions in the matroid polytope that may be of independent interest. 

1 Introduction 



x 



Making decisions in the face of uncertainty is the fundamental problem addressed by online computation 
[|5]]. In many planning scenarios, a planner must decide on the evolution of features to a product without 
knowing the evolution of the demand for these features from future users. Moreover, any features initially 
included must be retained for backward compatibility, and hence leads to an online optimization problem: 
given a set of features, the planner must phase the addition of the features, so as to maximize the value 
perceived by a user at the time of arrival. Typically, users have diminishing returns for additional features, 
so it is natural to represent their utility as a submodular function of the features that are present (or added) 
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when they arrive. Furthermore, the set of features that are thus monotonically added, are typically required 
to obey some design constraints. The simplest are of the form that partition the features into classes and there 
is a restriction on the number of features that can be deployed in each class. A slight extension specifies a 
hierarchy over these classes and there are individual bounds over the number of features that can be chosen 
from each class. We capture these, as well as other much more general restrictions on the set of deployed 
features, via the constraint that the chosen features form an independent set of a matroid. Thus, our problem 
is to monotonically construct an independent set of features (from a matroid over the features) online, so as 
to maximize the sum of submodular function values (users) arriving over time and evaluated on the set of 
features that have been constructed so far. 

This class of online optimization problems generalizes some early work of Awerbuch et al. [2]. They 
considered a set-cover instance, in which the restriction is to choose at most k sets with the goal of maximiz- 
ing the coverage of the elements as they arrive over time. This is precisely the online maximization version 
of the well-studied maximum coverage problem. Even this special case of our problem already abstracts 
problems in investment planning, strategic planning, and video-on-demand scheduling. 



1.1 Problem Setting, Main Result and Techniques 



In our setting^ we are given a universe of elements E, \E\ = m, and a matroid M. = (E,I(A4)) whose 
independent sets characterize the limitations on which sets of elements we can choose. At every time step i, 
1 < i < n, a client arrives with a non-negative monotone submodular function /j : 2 E — > Z + representing 
her welfare function. The objective is to maintain a monotonically increasing set F € Z{M) over time; 
that is, the set of elements (at time i — 1) can only be augmented to Fi after seeing /j at time step 
i. The welfare of client i is then fi(Fi), and our objective is to maximize fii.Fi)- We compare our 

performance to the offline optimum raaxoei(M) ^27=1 fi(0)- 

In this paper, we are concerned with the case when each of the submodular functions fi is a weighted 
rank function of some matroid Mi, i.e., fi(S) = ^3^icsj&i{Mr) 2~2eei w i,e wnere w i '■ E — > R+ is an 
arbitrary weight function. This class of submodular functions is very broad and includes all the examples 
discussed above; Furthermore, we believe it captures the difficulty of general submodular functions even 
though we have not yet been able to extend our results to the general case. Nevertheless, there are sub- 
modular functions which are not weighted rank functions of a matroid, for example, multi-set coverage 
function Q. 

Theorem 1.1. There exists a randomized polynomial time algorithm which is O (log 2 n log m log f ratio) - 
competitive, for the online submodular function maximization problem under a matroid constraint over m el- 
ements, when each fi, 1 < i < n, is a weighted rank function of a matroid and f ra tio = 2 " lax ' ' '' f 1 



'nuni.el/^e^o/iCfe})' 

In other words, the algorithm maintains monotonically increasing independent sets Fi 6 X(A4) such that 



E 



>tt( — - )• max yVi(O). 

Vlog 2 nlogmlog/ ra4io y Oel(M)^ 



i=l 



Our result should be contrasted with the lower bound proved in [2 



Lemma 1.2. [2] Any randomized algorithm for the submodular maximization problem under a matroid 
constraint is Q(\og nlog(m/r))-competitive, where r is the rank of the matroid. This lower bound holds 
even for uniform matroids and when all fi are unweighted rank functions. 



'For preliminaries and basic definitions, please see Section[2] 

2 The lower bounds in (2) relate even to a special case of uniform matroid and very restricted sub-modular functions. 
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We note that the O(logm) factor in our analysis can be improved slightly to an 0(log(m/r)) factor 
with a more careful analysis. A lower bound of f2(log f ratio) also follows even when the functions /, are 
linear (see, for example, 0). 

Main Techniques. To prove our results, we combine techniques from online computation and combinatorial 
optimization. The first step is to formulate an integer linear programming formulation for the problem. 
Unfortunately, the natural linear program is not well-suited for the online version of the problem. Thus, we 
formulate a different linear program in which we add an extra constraint that each element e contributes 
roughly the same value to the objective of the optimal solution. While this may not be true in general, we 
show that an approximate optimal solution satisfies this requirement. 

We note that the online setting we study is quite different from the online packing framework studied by 
ll6l and leads to new technical challenges. In particular, there are two obstacles in applying the primal-dual 
techniques in [ 6 ] to our setting. First, the linear formulation we obtain (that is natural for our problem) is 
not a strict packing LP and contains negative variables (See Section [3]). Second, the number of packing 
constraints is exponential, and hence the techniques of [6] would give a linear competitive factor rather than 
a polylogarifhmic one. Nevertheless, we present in Section [3] an online algorithm which gives a fractional 
solution to the linear program having a large objective value. One of the crucial ingredients is the uncrossing 
property of tight sets for any feasible point in the matroid polytope. 

To obtain an integral solution, we perform in SectionHJa natural randomized rounding procedure to select 
fractionally chosen elements. But, we have to be careful to maintain that the selected elements continue to 
form an independent set. The main challenge in the analysis is to tie the performance of the randomized 
algorithm to the performance of the fractional algorithm. As a technical tool in our proof, we show in 
Lemma l4~4l that randomly rounding a fractional solution in the matroid polytope gives a set which can be 
covered by 0(log n) independent sets with high probability. This lemma may be of independent interest and 
similar in flavor to the results of Karger [14] who proved a similar result for packing bases in the randomly 
rounded solution. 

1.2 Related results 

Maximizing monotone submodular function under matroid constraints has been a well studied problem 
and even many special cases have been studied widely (see survey by Goundan and Schulz |13]). Fisher, 
Nemhauser and Wolsey lfl2l gave a (1 — ^-approximation when the matroid is the uniform matroid and 
showed that the greedy algorithm gives a | -approximation. This was improved by Calinescu at al Q 
and Vondrak [20] who gave a (1 — ^-approximation for the general problem. They also introduced the 
multi-linear extension of a submodular function and used pipage rounding introduced by Ageev and Sviri- 
denko [1J. The facility location problem was introduced by Cornuejols et al. [9] and was the impetus behind 
studying the general submodular function maximization problem subject to matroid constraints. The sub- 
modular welfare problem can be cast as a submodular maximization problem subject to a matroid constraint 
and the reduction appears in Fisher et al. [12] and the problem has been extensively studied |[T8l[T6l[T7l[T5l . 
The result of Vondrak [20] implies a (1 — ~) -approximation for the problem. Despite the restricted setting 
of our benefit functions, we note that recent work in welfare maximization in combinatorial auctions iflOl 
has focused on precisely the case when the valuations are matroid rank sums (MRS) that we consider in our 
model. 

A special case of our online problem was studied by Awerbuch et al. JH. They studied an online variant 
of the max-coverage problem, where given n sets covering m elements, the elements arrive one at a time, 
and the goal is to pick up to k sets online to maximize coverage. They obtained a randomized algorithm 
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that is 0(log n log (m/k)) -competitive for the problem and proved that this is optimal in their setting. Our 
results generalize both the requirement on the cardinality of the chosen sets to arbitrary matroid constraints, 
and the coverage functions of the arriving elements to monotone submodular functions that are weighted 
rank functions of matroids. 

Another closely related problem with a different model of uncertainty was studied by Babaioff et al. 
0. They studied a setting in which elements of a matroid arrive in an online fashion and the goal is to 
construct an independent set that is competitive with the maximum weight independent set. They considered 
the random permutation model which is a non-adversarial setting, and obtained an O (log k) -competitive 
algorithm for general matroid, where k is the rank of the matroid, and constant competitive ratio for several 
interesting matroids. Recently, Bateni et al. studied the same model where the objective function is a 
submodular function (rather than linear). 

Chawla et al. [8] study Bayesian optimal mechanism design to maximize expected revenue for a seller 
while allocating items to agents who draw their values for the items from a known distribution. Their devel- 
opment of agent-specific posted price mechanisms when the agents arrive in order, and the items allocated 
must obey matroid feasibility constraints, is similar to our setting. In particular, we use the ideas about 
certain ordering of matroid elements (Lemma 7 in their paper) in the proof that our randomized rounding 
algorithm give sufficient profit. 

2 Preliminaries 

Given a set E, a function / : 2 E — > R + is called submodular if for all sets A, B C E, f(A) + f(B) > 
f(AnB)+f(AuB). Given set E and a collection 1 C 2 E , M = (E, 1(M)) is a matroid if (i) for all A el 
and B C A implies that B € T and (ii) for all A,B £ 1 and \A\ > \B\ then there exists a £ A \ B such that 
B U {a} € I. Sets in X are called independent sets of the matroid M.. The rank function r : 2 E — > R + of 
matroid M. is defined as r(S) = max.T^X:Tcs \T\. A standard property of matroids is the fact that the rank 
function of any matroid is submodular. 

We also work with weighted rank functions of a matroid, defined as f(S) = m&-x. ICS I( zx(M) See/ We 
for some weight function w : 2 E — > R + . Given any matroid M., we define the matroid poly tope to be 
the convex hull of independent sets P(A4) = conv{li : I g 1} C Rl E L Edmonds ifTTll showed that 
P{M) = {x > : x(S) < r(S) V S C E}. We also use the following fact about fractional points in the 
matroid polytope (The proof follows from standard uncrossing arguments. See Schrijver [191, Chapter 40). 

Fact 2.1. Given a matroid A4 = (E,I(M)) with rank function r and feasible point x E P(A4), let 
t = {S C E : x(S) = r(S)}. Then, r is closed under intersection and union and there is a single maximal 
set in t. 

3 Linear Program and the Fractional Algorithm 

We now give a linear program for the online submodular function maximization problem and show how 
to construct a feasible fractional solution online which is 0(logm log n log f ratio) -competitive. Before we 
give the main theorem, we first formulate a natural LP. Let O C E denote the optimal solution with the 
objective Yl7=l fi(Q)- Since each /j is the weighted rank function of matroid Mi, we have that fi(0) = 
Wi{Oi) = SggQ. Wi je where O 3 Oi G T(Ni). For the sake of simplicity, we assume that Wi :e = 1. In 
Section[5l we show that this assumption can be removed with a loss of 0(log f ra tio) factor in the competitive 
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LP X ■ max E"=lE e 6B 2 i,e 
s.t. 

VSCE J2eeS x e<r(S) (1) 

Vl<i<n,5CB Ee G 5^e<ri(5) (2) 

VI < i < n, e G E z i:£ < x e (3) 

VI < i < n, e G E z; L e ,x e > 



Figure 1 : LP for maximizing a sum of (unweighted) rank functions subject to matroid constraint 



LP 2 (a) : max £™ =1 £ eg £ z i,e 
s.t. 

VSCi? £ ee5 x e <r(S) (4) 

Vl<i<n,5CB EeeS z i,e < n(S) (5) 

VI < i < n, e G £^ Zi >e < x e (6) 

Ve G E Y!i=i z i,e < ax e (7) 

Ve G S E"=i > ^ (8) 

VI < i < n, e G E Z{ e , x e > 



Figure 2: A restricted LP for the submodular function maximization subject to matroid constraint 



ratio. Observe that in this case, fi(S) = ri(S), where is the rank function of matroid Mi for any set 
S C E. 

We next formulate a linear program where x e is the indicator variable for whether e G O and z\ fi is the 
indicator variable for whether e G Oj. Since O G Z(M) and Oj G Z{Mi), we have that x G P(7W) and 
£j G P{Mi) as represented by constraints (JT) and constraints (]2), respectively in Figured] 

We prove the following theorem. 

Theorem 3.1. There exists a polynomial time algorithm A that constructs a feasible fractional solution 
(x, z) online to LP\ which is 0(log n log m) -competitive. That is, the algorithm A maintains monotonically 

increasing solution (x,z) such that Y^i=i See_E z «,e = ^( \ogn\ogm ) where O is the optimal integral 
solution. 

To prove Theorem l3.ll instead of working with the natural linear program LP\, we formulate a different 
linear program. The new linear program is indexed by an integer a and places the constraints that each 
e G O occurs in [%, a] different Oj's as represented by constraints Q and d8). The parameter a will be 
defined later. 

The following lemma shows that if we pick 0(log n) different values of a then the sum of the integer 
solutions to the linear programs LP 2 (a) perform as well as the optimal solution^ 

3 We assume that the algorithm knows the value of n. In Section[5]we show how to deal with an unknown n losing an additional 
small factor. 
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Lemma 3.2. Let OPT denote the value of an optimal integral solution to linear program LP\ and let 
OPT a denote the optimal value of an optimal integral solution to the linear program LP<i{pt) for each 
a G {1,2, 4,..., 2^-1}. Then OPT <^ ae{124 ^ 2riognl} OPT a . 

Proof. Consider the optimal (integral) solution (x*, z*) to LP\. We decompose this solution in to [logra] 
integral solutions (y(a), z(a)) for each a 6 {1,2,4,..., 2r iogn l } where (y(a), z(a)) is feasible to Li-^a)- 
Set y(a) e = x* e for each e G E and each a. Consider any e G E. If a/2 < Ya=i z te — a tnen set 
z(a)i je = 1 for each i such that z* e = 1. Set all other z(a)i. e = 0. Clearly, (y(a), -z(a)) is feasible and is a 
decomposition of the solution (x*, z*) and hence sum of the objectives of (y(a), z(a)) equals the objective 
of(x*,z*). □ 

Using the above lemma, a simple averaging argument shows that for some guess a, the optimal integral 
solution to LP2(a) is within a log n factor of the optimal integral solution to LP\. Hence, we construct an 
algorithm which first guesses a and then constructs an approximate fractional solution to LP2(a). 



3.1 Online Algorithm for a Fractional LP Solution 

Given a fractional solution x, we call a set S C E tight (with respect to x) if x(S) = r(S). 



Guessing Algorithm: 




• Guess the value a £r {1, 2,4.. 


■ ,n}. 


• Run AlgG with value a. 





AlgG: 



• Initialize x e «— 1/m 2 (where m = \E\), set z^ e = for each i, e. 

• When function /j arrives, order the elements arbitrarily. 

• For each element e in order: 

• If VS\e G S, x{S) < r(S) and Zi {S) < n{S) - 1/2: 

x e ^— mm < x e ■ exp ( ^^^) , mm {r(S) — x(S \{e})}\ (9) 
[ \ a J s\ees J 

Zi, e <~ X e /2 (10) 



Using an independence oracle for each of the matroids, each of the conditions can be checked in polyno- 
mial time by reducing it to submodular function minimization (See Schrijver [ 19 1, Chapter 40) and therefore 
the running time of the algorithm is polynomial. Note that the fractional algorithm is carefully designed. 
For example, it is very reasonable to update greedily the value of Zi >e even when the algorithm does not 
update the value x e (of course, ensuring that Zi G P{Mi)). While such an algorithm does give the required 
guarantee on the performance of the fractional solution, it is not clear how to round such a solution to an 
integral solution. In particular, our algorithm for finding a fractional solution is tailored so as to allow us to 
use the values as rounding probabilities in a randomized algorithm. 

Before we continue, we define some helpful notation regarding the online algorithm. Let Xi >e (a) be the 
value of the variable x e after the arrival of user i for some guess a. Let Axj e (a) be the change in the value 
of x e when user i arrives. Let x e (a) be the value of x e at the end of the execution. Similarly, let z^ e {a) be 
the value of Zi e at the end of the execution. We start with the following lemma that follows from the update 
rule ©. 
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Lemma 3.3. For any element e € E, and guess a, 

yV e (a) > — (x e (a) 2) ( n ) 

48 log m \ m z J 

where x e (a) is the value at the end of the execution ofAlgG. 

Proof. Fix e, if a < 8 log m, then the consider the last 2^ e that we update (if there are none, the claim 
follows trivially). In this iteration Zj je <— x e (a)/2 > 16 £ — (x e (a) — ^r). Otherwise, if a > 81ogm 
then, 



y y Zi )e {a) = ^2 Xi >e (a)/2> Xj_i je (a)/2 

i=l i:Ax ije (a)>0 i:Ax i>e (a)>0 

> -r-r^ ^ Ax ie{a) = —^ (x e {a)-—A (12) 

48 log m ' 48 log m V ?ri / 

where Inequality (fT2l follows since — 1 < 3x for < x < 1, and therefore, Axj e < Xi je ■ 
exp ( 8k g m ) - < Xi, e 241 ° gm if cc ije is updated by the first condition in the equation (fl2~ll . If it is 
updated by the second condition, the inequality still holds since Axj je is even smaller. □ 

Next we prove that the solution produced by AlgG is almost feasible with respect to the optimal solution 
to LP 2 (a). 

Lemma 3.4 (Feasibility Lemma). Let (x(a), z(a)) be the fractional solution generated by AlgG at the end 
of the sequence. Then, it satisfies all constraints of LP2{a) except constraints ((Sj). 

Proof. We prove that the solution is feasible. 

Matroid constraints ©. Clearly, the algorithm never violates the matroid constraints by the second term 
in the equation (© in the algorithm. 

Constraints © and constraints ©. z^ e «— Xj je (a)/2 < x e (a)/2, thus constraints © hold. Finally, by 
the algorithm behavior we only update Zj e if for all S\e € S, Zi{S) < ri(S) — 1/2. Since by the above 
observations Zj e < x e (a)/2 < 1/2, we never violate constraints (O after the update. 

Constraints ©. This constraint follows since 



^ Zi,e = ^2 x iA a )/ 2 - Xe(ot)\{i : Ax i}C > 0}| 

1=1 i:Axi^ e >0 

However, after a augmentations, x e (a) > exp ^ 81o J m ■ c^J > 1. Thus, x e must be in a tight set and so 
by design we never update x e and any □ 

In order to evaluate the performance of the algorithm we first show that the size of the solution returned 
by the algorithm is large as compared to the optimal integral solution. Later in Lemma 13.71 we relate the 
objective value of the solution to its size. This lemma uses crucially the properties of the matroid. 
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Lemma 3.5 (Large Fractional Size). Let (x*(a), z*(a)) be an optimal integral solution to LP2(a). Let 
(x(a), z(a)) be the fractional solution generated by AlgG at the end of the sequence. Then we have, 

Proof. For any element e G E, let P e = {i \ z* e (a) = 1}- By constraint © of the linear formulation 
I -fe| > a/2. Observe that there can be at most j iterations where Axj je (a) > or equivalently Zj >e > 0. 
Otherwise after f updates, the value of x e {a) would be 

1 /81ogm(a/4)\ , 

— exp v 1 ' > 1 (13) 

m z \ a J 

which is a contradiction. By this observation we get that there are two possibilities for any element e such 
that x*(a) = 1. Since a is fixed we use x instead of x(a). 

1. There exists 5 C E, e G S such that x(S) = r(S) (recall that we term such elements "tight"). 

2. There exists at least § iterations i G P e such that Zi^{Si) > ri(Si) — \ for some S% containing e. 

Let n a be the number of elements such that x*(a) = 1. Then, either the first condition holds for n a /2 
elements, or the second condition holds for n a /2 elements. We prove that the Lemma holds in each of these 
options. 

First condition holds for n a /2 elements: In this case, let G be the set of all elements e with x*(a) = 1 
for which there exists a set S containing e with x(S) = r(S). The first condition implies that \G\ > ^f. Let 
r = {S : x(S) = r(S)} be the set of all tight sets. Fact 12. II implies that there is a single maximal set in t, 
say, S. Moreover, each element of G is in one of the sets and G G X(M). Therefore 

Y,Xe>Y,Xe = r{S)>\G\>^ 

Here the first inequality follows by summing only on elements that are in maximal tight sets. The second 
inequality follows since the maximal tight set contain n a /2 elements of the optimal solution that satisfies 
the matroid constraints. 

Second condition holds for n a /2 elements: In this case, we have that there are at least a/Ai G P e there 
exists a set Si C E,e G Si such that Zi(Si) > r»(5i) — 1/2. 
We next define for each iteration i the following set: 

Gi = {e | e is not tight , z* e = 1, Zi, e = 0} 

Then we have, Ya=i > ^ • § an d by Lemma [34l the solution Zi G P(Ni). We now show the following 
claim. 

Claim 3.6. Given a matroid M = (E, I) and a vector z G P(J\f) and a set G G X such that for all e G G, 
there exists a set S containing e such that z(S) > r(S) — \. Then ^2 eeE z e > ^p. 
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Proof. Consider the elements of G in some order. When considering e, increase z e as much as possible 
while ensuring that z remains in P(M). Call the new solution y. Observe that for each e G G, there exists 
a set S such that y(S) = r(S). Consider the set of tight sets r with respect to y and let S be the maximal 
set in r as given by Fact 12.11 Then J^eeEVe > y{S) = r(S) > \G\ since G G 1 and G C 5. But 
X^ge z e — SeGE ^ e ~~ SeeG 2 s i nce onr y variables in G are increased to a fraction of at most half. Thus 
we have ^2 eeE z e > ^ as required. □ 

Applying the claim for each Gj since Gj G X{Mi), we obtain that for each i, 

n 

Y,*i*>\Gi\/2 => EE^^ir 

eGE i=l eGE 

Since Lemma [3~4l implies that for each element e, z «,e < ax e> we have 

a J2eeE x e - Y^e&E Ya=i z i,e > ^f^. which concludes the proof. □ 
Finally, we prove a lemma bounding the performance of the algorithm. 
Lemma 3.7. For any guess value a, the algorithm maintains a fractional solution to LP2(a) such that: 



eG-E i=l \ 6 / 



where OPT a is objective of an optimal integral solution to LP2(a). 

Proof. Let (x*, z*) denote the optimal integral solution to LP2(a). If x* = for each e, then the lemma 
follows immediately. We have the following 



EeGE EILi «i,e(«) > isfc EeGE M«) ~ ^) {Lemma^Z 
> mk^EeeE^-^) (Lemmas 

= ^ ( logm XeGE Z)i=l z i,e( a ) 



where the last equality follows since in LP 2 (a) for each element Ya=i z ie( a ) — ax e anc ^ XeGE x e — 
1. This completes the proof of Lemma. □ 

Finally, we get our main theorem. 

Theorem 3.8. The online algorithm for the fractional LP solution (of LP\) is 0(log m log n) -competitive. 



Proof. The proof follows by combining Lemma (13 -4b . Lemma (13.7I ). Lemma (13.21 ) and the observation that 
there are 0(log n) possible values of a each is guessed with probability f2(l/ log n). □ 
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4 Randomized Rounding Algorithm 



In this section we present a randomized algorithm for the unweighted problem that is 0(log 2 nlogm)- 
competitive when each submodular function fa is a rank function of a matroid. The algorithm is based 
on the fractional solution designed in Section [3] Although our rounding scheme is extremely simple, the 
proof of its correctness involves carefully matching the performance of the rounding algorithm with the 
performance of the fractional algorithm. Indeed, here the fact that LP2(a) has extra constraints not present 
in LP\ is used very crucially. 

Theorem 4.1. The expected profit of the randomized algorithm is fl w )' 

The randomized algorithm follows the following simple rounding procedure. 
Matroid Randomized Rounding Algorithm: 

• F<-h 

• Guess the value a £r {1,2,4..., n}. 

• Run AlgG with value a. 

- Whenever x e increases by Ax e , if F U {e} € I{M) then F ^— F U {e} with probability 

Ale 

^_ 

In order to prove our main theorem, we prove several crucial lemmas. The main idea is to tie the 
performance of the randomized algorithm to the performance of the fractional solution that is generated. In 
the process we lose a factor of 0(log n). We first introduce some notation. All of the following notation is 
with respect to the execution of the online algorithm for a fixed value of a and we omit it from the notation. 
Let Fi denote the solution formed by the randomized algorithm at the end of iteration i and let F denote the 
final solution returned by the algorithm. Let Y" e l denote the indicator random variable that element e has been 
selected till iteration i. Let AY* denote the indicator random variable that element e is selected in iteration 
i. Let yl = Pr[Y* = 1] and Ay l e = Pr[AY£ = 1}. Finally, let y e denote the probability element e is in 
the solution at the end of the execution. Recall that Xj je denotes the value of the variable x e in the fractional 
solution after iteration % and let x e denote the fractional value of element e at the end of the execution of the 
fractional algorithm, and let Axi je be the change in the value of e in iteration i. 

Since the algorithm tosses a coin for element e in iteration i with probability Axj e /4, therefore the 
probability that an element e is included in the solution till iteration i is at most Xj je /4. Our first lemma states 
that the expected number of elements chosen by the algorithm is at least half that amount in expectation and 
is comparable to the total size of the fractional solution. Thus, Lemma 1431 plays the role of Lemma 1331 in 
the analysis of the randomized algorithm. 

Lemma 4.2. Let F be the solution returned by the randomized rounding algorithm, then E\\F\\ = X^ e e£ U e — 

8 

Proof. For every element e in the i th iteration, the algorithm tosses a coin with probability Axj ie /4 > 
and includes it in F if the element shows up in the toss and F U {e} is in T(M) for the current F. We 
maintain a set H in which we include an element which shows up in the toss but cannot be included in F. 
For any iteration i and element e, we have Pr[e € F] + Pr[e G H] = Ax^e/A. Let F denote the final 
set thus obtained. We now do the following random experiment. For every element e € span(F) \ F, we 
toss a coin with probability '^f and include it in a set PL' . Since every element in H must be in span(F), 
by a simple coupling of the random tosses in these two experiments, we obtain \H\ < \H'\. Moreover, 
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< = Yl,e£span(F) x e/4 < \F\ where the expectation is taken over the random tosses used 

for constructing H' since x G Pm an d F G Z(Ad) ■ Taking expectations over the coin tosses used for 
finding F, we obtain E[\H\] < E[\F\]. But E[\F\] + E[\H\] = J2 e eE x e/ 4 - Thus E [\ F W ^ Eee 8 g Xe ■ □ 

Our second lemma relates the change in the probability an element is chosen to the change in the frac- 
tional solution. This lemma shows that a crucial property of the exponential update rule for the fractional 
solution is also satisfied by the integral solution. 

Lemma 4.3. For each element e and iteration i, ^-f- < Ax ' ,e < 24 lo s m _ 

Proof. First, we prove that Ax ' ,e < 8 log m . This follows from the exponential update rule. If a < 81ogm, 
then we have ^ < 1 < Ml2£™. Otherwise, 

«81ogm\ -,\ 
—S-) - 1 ) < 24 log m 

where the second inequality follows since e x : — 1 < 3x for < x < 1. 

We next show that A 2/6 < implying the claim. To show ^ Ve < it suffices to show ^ Ve < 

Ayi . • / • ■ E,<i A ^ 

— whenever ? < z since = — 7. 

Now, observe that for any j, we have Ayi = Pr[e is included in F in the iteration] = Axl ■ 
Pr[Fj^ U {e} G Therefore, = ^M^i-l u M e ^OO] which 

is a decreasing function 

lA.X e 

of j since Fj-i is an increasing set as a function of j. □ 



We next prove a general lemma regarding randomized rounding in any matroid polytope. The proof of 
the lemma utilizes a lemma proven in Chawla et al. llSTl . 

Lemma 4.4. Given a matroid Af = (E,I) and a solution z such that for all S C E, z(S) < r(S)/2, 
construct a set F by including in e G F with probability z e for each e G E independently. Then, with high 
probability (1 — m l n i> ), F can be covered by 0(log m + log n) independent sets where m = \N\. 

Proof. The lemma follows by a coupling argument on the following process. Define the following process: 

• Set S <- E, set h,I 2 , ■ ■ ■ , I m <- 0. 

• As long as 5 is not empty iterate i = 1, 2, . . . , m. 

- In each iteration i, order the elements by any order (we later define a good ordering). 

- Go over the elements in S one-by-one according to the ordering. For each element e, if U U {e} 
is independent in M: 

* Toss a coin for e and include it in Ii with probability z e . 

* S ^S\{e}. 

- Otherwise, skip e in iteration i. 
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Observe that we always remove the first element in the ordering in each iteration from S, so there are at 
most m iterations. Also, note that each I{ is an independent set. We first claim that this process is equivalent 
to the process of tossing a coin for each element independently. That is, for each S' C E: 



Pr[h U I 2 U • • • U I m = S'] = JJ z e J] (1 - z e ) (14) 

eeS" e<£S' 

This is very simple. For each element e, there exists a single iteration 1 < j < m, in which we toss 
a coin for e. In this iteration it is included with probability z e that is independent of all other choices. 
Thus, the process actually defines a covering by independent sets of the process of selecting each element 
independently with probability z e . If we can prove that with high probability S is empty after only a 
logarithmic number of rounds we are done. This might not be true for any ordering, however, we prove that 
there exists some ordering for which this is true. The following lemma essentially follows from Lemma 7 
in El]. We include a proof here for completeness. 

Claim 4.5. Given a matroid J\f = (E,I) with a rank function r and a fractional solution z such that for all 
S C E, x(S) < r(S)/2, there exists an ordering of the elements, such that for all e G E, the probability we 
toss a coin for e is at least 1 /2. 

Proof. We prove that there always an element e that we can put last and if / is the independent set con- 
structed by including rest of the elements in any order / U {e} € X(AT) with probability at least ^. The 
claim then follows from recursing on rest of the elements. 

Suppose we put some e € E last in the order. Let D be the set constructed by selecting each element 
with probability z e . When we reach e, we will toss a coin for e only if e ^ span(D \ {e}). Let p e be the 
probability e is being tossed when it is put last. Thus, 

p e = Pr[e ^ span(D \ {e})] > Pr[e ^ span(D)] 
Let B be any base for the matroid and let k be the rank of the matroid. Then we know that: 



^Pe > ^2 Pr[e (£ span(D)} (15) 

e&B eeB 

= ^Pr[D]{k-rank{D)) 
D 

= k - £[rank(L>)] 

> k - E[\D\] > k/2 (16) 

Inequality ( fT6l > follows since by our assumption -E[|-D|] = Yle&E z e — Averaging we obtain that there 
exists an element in B such that p e >\. We put this element last in the order and recurse. □ 



Suppose in each iteration we choose an ordering given by Claim 1431 Then for each element e G E and 
round i, the probability it "survives" in 5 given that it "survived" previous iterations is at most 1/2. Thus, 
the probability there exists an element in S after 0(log m + log n) iterations is at most 1 / (mn) c for some 
large constant c. □ 
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We now prove a relation between the profit obtained by the algorithm at iteration i, denoted by the 
random variable rj(i^), and the events that a particular set of elements are chosen in the solution. For any i, 
Let Hi denote the set of elements such that Zj e > 0. Note that z^ e > iff Axj je > 0. 

Lemma 4.6. Ya=i E[n{Fi)] > Ya=i Y^e&H, vt> where c is some constant. 

Proof. Let F- = Fi n Hi = {e G Fi : Zi je > 0}. In other words, F[ is the set of elements included by 
the algorithm in iteration i. Observe that each element e is included in F[ with probability at most x^ e /A = 
Zi ye /2 and is dropped sometime even when the toss comes up correct if adding e violates the independence 
constraints for matroid M.. Now consider the set F" where each element is included with probability Zi^/2 
with no regard to the matroid constraints. Coupling the tosses of the two random experiments, we get 
F[ C F" . Since z% € P{Mi) by Lemma l3~4l Lemma |4~41 implies that F" is covered by at most 0(\og(mn)) 
independent sets with high probability 1 — l/(mn) c for some large constant c. Let A, be the event F[ can 
be covered by at most 0(log(mn)) independent sets in Mi, where m = \E\. Therefore, we get that: 



log(mn) 

Let A be the event that for all i, F[ can be covered by 0(log(mn)) independent sets. Then, 



5>[r,(F 4 )] > E[n(F!)} > E E h(F!)\A] • Pr[A] > ^ ' 1 J ■ Pr 



clo&imn) 

-1 i=l <=i 6V ; 



[A] 



> "T— 7 — \ I E iY y Y l\ - nm ■ p r[A] I =0| - ] T Vy' 
clog(mn) \ L ^ ^ eJ L J / \ log(mra) ^ ^ e j 

Here the last inequality follows since 1 < Ya=i YleeHi Ve — 171 ' n an ^ ^ r [A] < - 1 ; for some constant 
d. (If Y27=i Yle&H. < 1 tnen tne statement follows easily). □ 

Now we have all the ingredients to prove Theorem 14.11 
Proof of Theorem 14. It We prove that the expected profit of the algorithm with a guess a is at least 
SI ( log^m Sg n ) • Since each a is guessed with probability 1/logn and the value of OPT is the sum over 
all values a we get the desired. The expected profit of the algorithm when we guess a is at least. 

S=iWl > ^ZtlZeenM {LemmaWM 

= Y2e£E d logmlogn^ e (Xi:eei/; = Ve) 

^ T,eeE sc'iogmiogn ^ (LemmaMS) 
= n ( , " ) = J2 f . QP f° ) (LemmaESl) 

V log m log n / V log m log n / v 1 ' 

□ 



5 Extensions 

In this section, we give simple reductions to deal with various extensions including the case of arbitrary 
weighted rank functions and the case when the algorithm does not the number of arrivals n in advance. 
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Extension to the weighted version 

In this section we show a simple reduction from the weighted rank function case to the unweighted case 
losing an additional factor of 0(log f ratio)- The same reduction should be used when considering the ran- 
domized algorithm in Section [4] We first deal with the case in which f ra tio is known in advance. In this case, 
to solve the weighted version, first guess a value j e {0, 1, . . . , fratio}- Let f min = min j)e .j.( {e } )>0 /i(W). 
Then for each i, replace fi with the following fij which has a restriction on the elements' weights. fij(S) = 
max/cs,/GX(7U) \i e € I, 2J ' fmin < Wi,e < 2 i+1 • /mm} I- It is clear that for any /, and any S: 

f,lS) '-'I- V-fmin-fij(S) 

j — Ojlj'") fratio 

As for each j we get a competitive algorithm, we lose an extra 0(log fratio) due to the guess at the beginning. 

If fratio is unknown, we modify our guess slightly. We guess value 2* with probability ^(log^+i))^ ' 
for a suitable c that depends on e. We then make the same changes as before with this value. It is easy 
to verify that with a suitable c, the sum of probabilities over all values i is at most 1. Moreover, the prob- 

;lass 

0(log fratio (log log f ra tio) 1+e ) due to the guess. 



ability of each weight w class is n — \ rrrr = i — i n — i — i tttt, and so we lose an extra 

J b Mog«)(loglogu>) 1 + e log/ ratio (loglog/ rat i ) 1 +' ! 



Dealing with an unknown n 

In many cases the number of submodular functions n is unknown in advance. We show here a simple 
modification to the algorithm that can handle such cases while losing an additional log log n factor. The 
idea is simply to replace the guessing stage in the algorithm. In the modified guessing stage, we guess value 
a = 2 l with probability ci (i og ^ +i ^i+e > where c is a constant that depends on e. We then run the algorithm 
with our guess. First, note that (with a suitable value c) the total sum of probabilities of all values a is 
bounded. 

^ c ■ i{log(l + i)) 1+e < 1 

To analyze the performance, assume that the number of submodular functions is n, then by Lemma 13771 
for any a < n our profit is Q, ( ^^f ) ■ Thus, the total expected profit of the algorithm is at least: 

^ 1 OPT a > 1 OPT a / OPT 

c • z(log(l + i)) 1+t logm — i ^ c • logn(loglogn) 1+e logm \logmlogn(loglogn) 1+e 

The same strategy also applies to the randomized rounding algorithm. 
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